Advanced Methods for Glottal Wave Extraction
نویسندگان
چکیده
Glottal inverse filtering is a technique used to derive the glottal waveform during voiced speech. Closed phase inverse filtering (CPIF) is a common approach for achieving this goal. During the closed phase there is no input to the vocal tract and hence the impulse response of the vocal tract can be determined through linear prediction. However, a number of problems are known to exist with the CPIF approach. This review paper briefly details the CPIF technique and highlights certain associated theoretical and methodological problems. An overview is then given of advanced methods for inverse filtering: model based, adaptive iterative, higher order statistics and cepstral approaches are examined. The advantages and disadvantages of these methods are highlighted. Outstanding issues and suggestions for further work are outlined.
منابع مشابه
Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speech
This paper presents a robust feature extraction method effective to speech signal with high fundamental frequency and/or corrupted by additive white noise. The method represents the glottal source wave using HMM in order to model the nonstationary properties. The nodes of HMM are concatenated in a ring state to represent the periodicity of voiced sounds. The method can accurately extract glotta...
متن کاملOn the importance of glottal flow spectral energy for the recognition of emotions in speech
Two new approaches to feature extraction for automatic emotion classification in speech are described and tested. The methods are based on recent laryngological experiments testing the glottal air flow during phonation. The proposed approach calculates the area under the spectral energy envelope of the speech signal (AUSEES) and the glottal waveform (AUSEEG). The new methods provided very high ...
متن کاملMaximum a Posterior Probability and Cumulative Distribution Function Equalization Methods for Speech Spectral Estimation with Application in Noise Suppression Filtering
The COST-277 speech database p. 100 Children's organization of discourse structure through pausing means p. 108 F0 and intensity distributions of Marsec speakers : types of speaker prosody p. 116 A two-level drive response model of non-stationary speech signals p. 125 Advanced methods for glottal wave extraction p. 139 Cepstrum-based estimation of the harmonics-to-noise ratio for synthesized an...
متن کاملA comparative study of glottal open quotient estimation techniques
The robust and efficient extraction of features related to the glottal excitation source has become increasingly important for speech technology. The glottal open quotient (OQ) is one relevant measurement which is known to significantly vary with changes in voice quality on a breathy to tense continuum. The extraction of OQ, however, is hampered in the time-domain by the difficulty in consisten...
متن کاملSteady Flow Through Modeled Glottal Constriction
The airflow in the modeled glottal constriction was simulated by the solutions of the Navier-Stokes equations for laminar flow, and the corresponding Reynolds equations for turbulent flow in generalized, nonorthogonal coordinates using a numerical method. A two-dimensional model of laryngeal flow is considered and aerodynamic properties are calculated for both laminar and turbulent steady flows...
متن کامل